Computationally Effic Modification of Speech Us
نویسندگان
چکیده
Among the conventional time-scale modification methods [1][6], the synchronized overlap and add (SOLA) method [4] is used widely because of its good performance with relatively low computational complexity. But the SOLA method still requires much computation in evaluating the normalized crosscorrelation function for synchronization procedure [9]. In this paper, we employ 3 level center clipping method in order to reduce the computational complexity of SOLA method. The result of subjective preference test indicates that the proposed method can reduce computational complexity by over 80% comparing with the conventional SOLA method without considerable performance degradation. We also apply the variable time-scale modification method using transient information [7] to the proposed algorithm. By doing so, we can maintain the intelligibility of time-scale modified speech in the case of very fast playback.
منابع مشابه
Real Time Prosody Modification
Real time prosody modification involves changing the prosody parameters such as pitch, duration and intensity of speech in real time without affecting the intelligibility and naturalness. In this paper prosody modification is performed using instants of significant excitation (ISE) of the vocal tract system during production of speech. In the conventional prosody modification system the ISE are...
متن کاملA Unix-based Speech Data Collection Platform
It is highly desirable to collect speech data from the telephone network via a digital interface. This avoids an additional A/D conversion normally required by analog telephone data collection hardware. A popular solution to this problem is the use of a T1 line which offers 24 digital phone lines. The leading T1 interface for Sun workstations is a system developed by Linkon Corporation. Using t...
متن کاملAn Overlap-add Technique Based on Waveform Similarity (wsola) for High Quality Time-scale Modification of Speech
A concept of waveform similarity is proposed for tackling the problem of time-scale modification of speech, and is worked-out in the context of short-time Fourier transform representations. The resulting WSOLA algorithm produces high quality speech output, is algorithmically and computationally efficient and robust, and allows for on-line processing with arbitrary timescaling factors that may b...
متن کاملEpoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals
Timeand pitch-scale modifications of speech signals find important applications in speech synthesis, playback systems, voice conversion, learning/hearing aids, etc.. There is a requirement for computationally efficient and real-time implementable algorithms. In this paper, we propose a high quality and computationally efficient timeand pitch-scaling methodology based on the glottal closure inst...
متن کاملOverlap-add methods for time-scaling of speech
In this tutorial on time scaling we follow one particular line of thought towards computationally efficient high quality methods. We favor time scaling based on time-frequency representations over model based approaches, and proceed to review an iterative phase reconstruction method for time-scaled magnitude spectrograms. The search for a good initial phase estimate leads us to consider synchro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002